Finding outliers at multiple scales

نویسندگان

  • Tianming Hu
  • Sam Yuan Sung
چکیده

Outlier detection targets those exceptional data whose pattern is rare and lie in low density regions. In this paper, under the assumption of complete spatial randomness inside clusters, we propose an MDV (Multi-scale Deviation of the Volume) approach to identifying outliers. In addition to assigning an outlier score for each object, it directly outputs a crisp outlier set. It also offers a plot showing the data structure in every object’s vicinity, which is useful in explaining why it may be outlying. Finally, the effectiveness of MDV is demonstrated with both artificial and real datasets.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Stability Analysis of a Strongly Displacement Time-Delayed Duffing Oscillator Using Multiple Scales Homotopy Perturbation Method

In the present study, some perturbation methods are applied to Duffing equations having a displacement time-delayed variable to study the stability of such systems. Two approaches are considered to analyze Duffing oscillator having a strong delayed variable. The homotopy perturbation method is applied through the frequency analysis and nonlinear frequency is formulated as a function of all the ...

متن کامل

Finding Multiple Outliers from Multidimensional Data using Multiple Regression

The knowledge of weather is useful for finding climate change over a period. In this present frame work uses 15 years of weather of Hyderabad city , data a real time the datasets collected from weather station. Weather data is a time series and multidimensional data. Outliers are the objects whose behavior is different from the rest. Outliers in weather data represent the cyclone, drought, seas...

متن کامل

Identification of outliers types in multivariate time series using genetic algorithm

Multivariate time series data, often, modeled using vector autoregressive moving average (VARMA) model. But presence of outliers can violates the stationary assumption and may lead to wrong modeling, biased estimation of parameters and inaccurate prediction. Thus, detection of these points and how to deal properly with them, especially in relation to modeling and parameter estimation of VARMA m...

متن کامل

Who Should be Interviewed? A Response from Cluster Analysis

Objective: This article presents an application of cluster analysis for social sciences researches especially those studies that have an interview as part of their data collection. This application is more suitable for sequential mixed method researchers who use quantitative data to frame subsequent qualitative subsamples for conducting interviews.  Methods: In more detail, the algorithm (i....

متن کامل

Impact of Outliers in Data Envelopment ‎Analysis‎

This paper will examine the relationship between "Data Envelopment Analysis" and a statistical concept ``Outlier". Data envelopment analysis (DEA) is a method for estimating the relative efficiency of decision making units (DMUs) having similar tasks in a production system by multiple inputs to produce multiple ‎outputs.‎ An important issue in statistics is to identify the outliers. In this pap...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • International Journal of Information Technology and Decision Making

دوره 4  شماره 

صفحات  -

تاریخ انتشار 2005